Representing Periodic Structure in Speech
نویسندگان
چکیده
An eigenvalue method is developed for analyzing periodic structure in speech. Signals are analyzed by a matrix diagonalization reminiscent of methods for principal component analysis (PCA) and independent component analysis (ICA). Our method—called periodic component analysis ( CA)—uses constructive interference to enhance periodic components of the frequency spectrum and destructive interference to cancel noise. The front end emulates important aspects of auditory processing, such as cochlear filtering, nonlinear compression, and insensitivity to phase, with the aim of matching the robustness of human listeners. The method avoids the inefficiencies of autocorrelation at the pitch period: it does not require long delay lines, and it correlates signals at a clock rate on the order of the actual pitch, as opposed to the original sampling rate. We derive its cost function and present some experimental results.
منابع مشابه
Periodic Component Analysis: An Eigenvalue Method for Representing Periodic Structure in Speech
An eigenvalue method is developed for analyzing periodic structure in speech. Signals are analyzed by a matrix diagonalization reminiscent of methods for principal component analysis (PCA) and independent component analysis (ICA). Our method—called periodic component analysis (πCA)—uses constructive interference to enhance periodic components of the frequency spectrum and destructive interferen...
متن کاملInvestigating the formal effect of rear wall structure on acoustic parameters of speech halls (Research Article)
Referring to the rear wall in a hall is the furthest element rather than the voice source, therefor the reflections of this structural member play important role in music and speech intelligibly, especially for one-third behind audiences. Hence the form of these structures can be very effective in the acoustical quality of speech halls and auditoria. In this study, four formic structures are ex...
متن کاملUniform concatenative excitation model for synthesising speech without voiced/unvoiced classification
In general, speech synthesis using the source-filter model of speech production requires the classification of speech into two classes (voiced and unvoiced) which is prone to errors. For voiced speech, the input of the synthesis filter is an approximately periodic excitation, whereas it is a noise signal for unvoiced. This paper proposes an excitation model which can be used to synthesise both ...
متن کاملAN INDEX REPRESENTING STRUCTURE-CATABOLIC FATE RELATIONSHIPS OF AMINO ACIDS
Based on the Randic suggestion of the resolution of a structure into shape, size, and function an index representing structure-catabolic fate relationships of amino acids is constructed. The index obtained by multiplying three factors; (nb2 + nr), nc and M; representing shape, size and function respectively, where nb2 = number of double bonds, nr = number of rings, nc = number of carbon atoms, ...
متن کاملDecomposition of Speech into Voiced and Unvoiced Components Based on a Kalman Filterbank
We present a novel method for decomposing speech into signals representing the voiced and unvoiced components of speech. The method involves first demodulating the variations in spectral envelope, energy and pitch, and then applying a bank of Kalman filters to separate the harmonic and non-harmonic components of the signal. The use of Kalman filters relies on a state-space representation of the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007